RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Applicalion Serial Number 
Source 

4)ate Processed b>^STlC: 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO I IIK 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 7I)J 30.S-4212; FAX: 703-308^221 
. Effective 12/13/03 : TELEPHONE: 57 I -272-25 1 (1; FAX: 571-273-022! 

TO RKDllCI' EKRORF.I.) SROUf NCI- LISTINGS, PLEASE L'SH riH- CH E( KKK 
VERSION 4,1 PROGRAM , ACCILSSIIUT. THROUGH Till: U S IVXTENT AND 
TRADEMARK OFFICE WEBSITE SEl lU-EOW FOR ADDRESS 
h((i)://\vwvvaispto.uov/\vcl)/ofriC(\s/|):ic/clicckcr/clikr41iH)tc\l^ 

^ ■ 

Applic;inls submilting gcnclic sc(iiicntc inromiaiion cicclrom«4*+T on diskfTlc or CD-Roni slioiiltl he ilir 

;i pobsibililv dial the disk/CD-Roai iikiv luvc been .illeclcd bv ircnlinciU given lo ;i!l incoming mail 

Please consider using alternate inclhods of siibimssiuii lor ilic disk/CD-Roin or replacement disk/CI)-Ki)in 

Any re ply including a sequence hstinu m electronic Con n siiould N OT be sent to the 2023 1 /jp code a ddress .1 p r . ( 1 1 e 

United States Patent and Trademark Orfice, a nd instead siiould be sent via tiic following to tlic ind icated addresses 

1 FiFS Bio (<li((p://\v\vw.iis|)to.uov/chc/cfs/d()\vrilon(ls/(lociimcnts.htin> , EES Sul)missi(m 

User Manual - ePA VI-) 

2 U.S. Postal Scn'icc: Commissioner for Patcnti.. P C Box 1450. Alexandria. VA 22313 Pl.SO 

3 Hand Can y directly to (KFFECTIVE 12/01/03): 

U S Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 113*03, Cn'slal IMa/a I'wo. 
201 1 Soulii Clark Place, Arlmgion, VA 22202 
4. !'\-dcral Kxpress, United I'aicel Ser.vicc, or-otlicf tielivcry seiTicc Uk U'S. Patent and Tradciuaik OKiee. 
Box Sequence, Roam ^D()3-Mailrooln. Crystal Pla/^i Two. 201 1 South Clark Place. Arlington, VA 2:M02 





Revised 10/08/03 



Raw Sequence Listing Error Summaiy 



V1BER;* 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER;, 

ATTN: N^ RULES CASES: PLEASE DISRB<?ARD ENGLISH "ALPHA" HEADEI^^ WHICH WERE INSERTED BY PTO SOFTWARE 



10 



11 



12 




1 y Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. This may occur if your Tile 
Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping.'* 

Malid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



{/won 



Misaligned Amino The numbering under each S*** amino acid is misaligned. Do not use tab codes between numbers; 
Imbering use space characters, instead. 

ASCII The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Pl ease _ 

ensure your subsequent submission is saved in ASCII text. 

Variable Length Sequence(s) contain n*s or Xaa*s representing more than one i^sidue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

_PatentIn 2.0 . A "bug" in Patentin version 2.0 has caused.the <220>-<223> section to be missing from amino acid 

"bug" sequences(s) - Normally, Patentin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

^Skipped Sequences Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(OLD RULES) (2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION.SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 



Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



_Skipped Sequences 
(NEW RULES) 



Use of n's or Xaa's 
"(NEW RULES) 

Jnvalid <2I3> 
Response 



Sequence(s) 



missing. If intentional, please insert the following lines for each skipped sequence. 



<2I0> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1 .823 of Sequence Rules, use of <220>-<223> is MANDATORY if n*s or Xaa*s are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

* Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Use of<220> 



Sequcnce(s) missing the <220> "Feature" and associated numeric identifiers and responses. 

Use of <220> to <223> is MANDATORY if <213> "Organism" responsris "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <320> to <223> section. 

(See "Federal Register," OGrOI/1998, Vol. 63. No.- 104, pp. 2963J;^2) (Sec. 1 .823 of Sequence Rules) 

iP'atentIn 2.0 Please do not use "Copy to Disk" function of Patentin version 2.0. This causes a corrupted file, ' 

"bug" resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 

listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



1 3 Misuse of n/Xaa "n" can only represent a single nucleotide ; "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 
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IFWO 



RAW SEQUENCE LISTING DATE: 03/18/2004 

PATENT APPLICATION: US/10/796,669 TIME: 15:02:36 

Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\03172004\J796669.raw 



W — > 
C— -> 
C— > 
W — > 
W — > 



3 <110> APPLICANT: Jin-Town Wang 

4 Tzu-Lung Lin 

5 <120> TITLE OF INVENTION: A TYPE II RESTRICTION ENDONUCLEASE AND APPLICATION THEREOF 

6 <140> CURRENT APPLICATION NUMBER: US/10/796 , 669 

6 <141> CURREN T FILING DATE: 2004-03-09 -<\!^ 
d (<130> FILE ^FERENCE r^^ -^^^-^^Cy \ D^'^^ h^^CC^ 
6 <160> NUMBER OF SEQ ID: 5 A 9 ^ 

1 <17 0> SOFTWARE: 



ERRORED SEQUENCES 

E~> 8 <210> SEQ ID NO 

9 <211> LENGTH : <^NGT 

10 <212> TYPE: DNA 

11 <213> ORGANISM: 
E~> 13 <400> SEQUENCE: 
E — > 14 ccatc 5 
E — > 16 <210> SEQ ID N04. 



MICROSOFT WORD 2000 

^^>^r<si^'^^ Not CompW ^ 





Helicobacter pylori 



corrected diskette 




17 <211> LENGTH : ( ^NGT H^ 617 , Cv^tW^ 

18 <212> TYPE: DNA '''''^^--<>t^^ c/^^^^^ 

19 <213> ORGANISM: ORGANISM: Helicobacter pylori 

21 <300> PUBLICATIO N INFORMATIO N 

22 ^30 8>. JlATTSESSg^CCE S SIGN 



pf^^oy^ 



W— > 24 <300^JTOLICATI^^^ 





/EMBL/Genbank;Accession No. :AB118944 



act aaa aaa'^Scg gc'a^cgC aaa att tta age ttt tcals 




E — > 



E — > 



E — > 



acc acc atg cga aac cct aaa 60 
aga ata gga caa ttt tta get gtt tta gga aag ttt gaa 

aat caa ate ctt aaa tct tea 120 
ata ate atg caa att ate aaa tec gtt ttg get eat agg 

ctt tat aga cct act tct etc 180 
aat caa aat aaa gaa ttg aaa gaa aaa ttt gac tec aat 

gaa tat gtc ttt age gat gaa 240 
gag tta gaa cgc att ata gaa ata tec eca caa aat cat 

aaa gaa atg ggc ttt gag cat 300 
gga tgg gaa age egg ttt gac act tgg tat aag ctt atg 

tgt gag ttt ggt ttt tgc tac 360 
tat gea aaa tat gag aaa ata etc ate age gat age get 

aag atg ctt att ctt get tat 420 
tac aat aaa gaa aac gat get ttt aaa gaa age gtt gat 

gaa age gta gtt ggg get ata 4 80 
ttt tta aac get etg tct aaa tat gaa gta gga aac cct 
tac aaa aa 



7: 



,g „t t ta aac cat 54 0 ^ ^T^alHi A ^ |0 , feC^-^gS 



file://C:\CRF4\Outhold\VsrJ796669.htm tfi^A^i^ 
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RAW SEQUENCE LISTING DATE: 
PATENT APPLICATION: US/10/796 , 669 TIME: 

Input Set : A:\PTO.DA.txt 
_Out£ ut Se t: N:\CRF4\03172004\J796669.raw 



03/18/2004 
15:02:36 



E — > 

E~> 

E — > 

E — > 

E— > 

E — > 

E— > 

E — > 

E~> 

E — > 

E~> 

E~> 

E— > 

E — > 

E — > 

E— > 

E — > 

E~> 
E~> 
E — > 



E — > 



lac aac cct ttc aaa eta ttg etc teg ett tta aaa egar 

53/ etc aaa aat gcc cat eta ace 600 

55/ ccc eta tet gte aaa gaa ate cet att tta ett tgt tgg 

aaa gac gat aac get aat ggg 660 
ett tat gac tae att att cgt tta aga caa gaa ate gtt 

act ate aat aaa aca gaa ttc 720 
age tae tea gat gaa ttt ate tat gaa aaa tge eta aaa 

ett tta gaa agt gtt aat aaa 780 
aca ega ttt aaa atg age caa ate act aac gaa gee gtt 

gat gaa tae att aga aaa atg 840 
cgt att aca gga ett att tea ttg egt ggt aat ggt agg 
68/ ttt att gat att aat act aat 900 

70/ gaa aat aat aaa ata gat tae att tta caa ace eat aag 

71/ get ttt aaa ggg gat tat tta 960 

73 aac gae act caa get aac aaa etc gee ttt ttt aac tae 

7^ atg gcg ate gtg gat age ttt 1020 

7(; ett gtt agt gtt act eea ate age get aat gag age gtt 

7'' aaa tea age aaa ttg aat gaa 1080 

7?> eta gca aac act tat act aaa gat ttt ate aag eaa gaa 

8(' tta etc att act tgt aac aag 1140 

85 eaa gaa tea aaa gat agt ttt tta aga etc att gat aaa 

82 cct tta egc tta gaa ttt tta 1200 

85 age get att ttc ttg aaa eaa eat ttt gaa aat tta age 

86 gtg ata ccc aat tat aaa age 1260 

88 gat gat gaa gge ttg ecc gta tae aca gca age ggt aat 

8^' aaa cct gat att gta get atg 1320 

9% gac aca aaa gcc caa agt tat ata gaa gtg age ttg att 

aga gae aga agt eaa agt ace 1380 
ttg gaa atg ata cct att gee aga cat tta aaa gaa ttg 



94 

95 
97 

98 



att aaa aat age ace gat att 1440 
aga gaa aaa ttt agt gtt ttt gta get eea aat ate eat 

gat gat gcc aaa gaa tat gcg 1500 
lOd gaa ttt gee eaa ttc aaa gae aat att aat ata tgt tgt 

101 tat get att aat gat ttt ate 1560 

lOBl^aa aaa gta gaa aac age ata gaa tgg tta cag ate aat 

104( ^,gac eat ttg aaa g ^ taa 1 6 17 

107 <2To>''"sEQ"^irrTJo^ — — 

108 <211> LENGTH :(^NGTPn^38p^ (AU/^C^/\Mj^f^^ 

109 <212> TYPE: PRr^^r^rqt^^''^*^ ^ ' 

110 <213> ORGAN ISM: (QRGAnTs^ 

113 <400> JEfiUENCE^^^^^NS^i^^ 
114/Met "fhr Lys Lys Pro Ala Arg ' Lys 

Thr Thr Met 
1 5 1.0 



Arg Asn Pro Lys Arg lie Gly Gin Phe Leu Ala Val Leu 

Gly Lys Phe — — 

ge-^ __25 ^ 30 

Glu Asn Gin lie Leu Lys Ser Ser He He Met Gin He 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/796,669 

Input Set : A:\PTO.DA.txt 



DATE: 03/18/2004 
TIME: 15:02:36 



^^^-.--^ "^lEpTrt-Set-:---^ACRF4\0317^^^ 

/124,/lle Lys Ser 
125 35 40 45 

127 Val Leu Ala His Arg Leu Tyr Arg Pro Thr Ser Leu Asn 

128 Gin Asn Lys 

129 50 55 60 

131 Glu Leu Lys Glu Lys Phe Asp Ser Asn Glu Tyr Val Phe 

132 Ser Asp Glu 

133 65 70 75 

134 80 

136 Glu Leu Glu Arg lie lie Glu lie Ser Pro Gin Asn His 

137 Lys Glu Met 

138 85 90 95 

14 0 Gly Phe Glu His Gly Trp Glu Ser Arg Phe Asp Thr Trp 

141 Tyr Lys Leu 

142 100 105 110 

^144 Met Cys Glu Phe Gly Phe Cys Tyr Tyr Ala Lys Tyr Glu 

45 Lys lie Leu 

46 115 120 125 

ll4 8 He Ser Asp Seir Ala Lys Met Leu He Leu Ala Tyr Tyr 

149 Asn Lys Glu 

IfSO 130 135 140 

152 Asn Asp Ala Phe Lys Glu Ser Val Asp Glu Ser Val Val 

153 Gly Ala He 

/l54 145 150 155 . 

/l55 160 

157 Phe Leu Asn Ala Leu Ser Lys Tyr Glu Val Gly Asn Pro 

158 Tyr Lys Lys 

159 165 170 175 

161 Asn Leu Asn His Asn Asn Pro Phe Lys Leu Leu Leu Ser 

162 Leu Leu Lys 

163 180 185 190 

165 Arg Leu Lys Asn Ala His Leu Thr Pro Leu Ser Val Lys 

166 Glu He Pro 

167 195 200 205 

169 He Leu Leu Cys Trp Lys Asp Asp Asn Ala Asn Gly Leu 

170 Tyr Asp Tyr 

171 210 215 220 

173 He He Arg Leu Arg Gin Glu He Val Thr He Asn Lys 

174 Thr Glu Phe. 

175 225 230 235 

176 240 

178 Ser Tyr Ser Asp Glu Phe He Tyr Glu Lys Cys Leu Lys 
^79 Leu Leu Glu 
L80 245 250 255 

^82 Ser Val Asn Lys Thr Arg Phe Lys Met Ser Gin He Thr 
1^3 Asn Glu Ala 

184 260 265 270 

18 5 Val Asp Glu Tyr He Arg Lys Met Arg He Thr Gly LeM-"" 




E — > 



187^s^e Ser Leu 
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RAW SEQUENCE LISTING DATE: 03/18/2004 

PATENT APPLICATION: US/10/796,669 TIME: 15:02:36 

Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\03172004\J796669.raw 



E— > 188 275 ^^^280^' 285 

1/90 Ar^-^ly Asn Gly Arg Phe lie Asp lie Asn Thr Asn^ 
E — > i91/iVsn Asn Lys 



E--> il9i 290 295 



E-/> 



300 



/194 lie Asp Tyr lie Leu Gin Thr His Lys Ala Phe Lys Gly 

195 Asp Tyr Leu 

196 305 310 315 

197 320 

200 Asn Asp Thr Gin Ala Asn Lys Leu Ala Phe Phe Asn Tyr 
E — 201 Met Ala He 
E — > 202 325 330 335 

204 Val Asp Ser Phe Leu Val Ser Val Thr Pro He Ser Ala 
E — 5f 205 Asn Glu Ser 
E~>| 206 340 345 350 

j 208 Val Lys Ser Ser Lys Leu Asn Glu Leu Ala Asn Thr Tyr 
E — >l 209 Thr Lys Asp 
E— >| 210 355 360 365 

i 212 Phe He Lys Gin Glu Leu Leu He Thr Cys Asn Lys Gin 
E — >! 213 Glu Ser Lys 
E~> 214 370 375 380 

216 Asp Ser Phe Leu Arg Leu He Asp Lys Pro Leu Arg Leu 
E — > 217 Glu Phe Leu 
E~> 218 385 390 395 

E — > 219 400 

221 Ser Ala He Phe Leu Lys Gin His Phe Glu Asn Leu Ser 
E — > 222 Val He Pro 
E~> 223 405 410 415 

/ 225 Asn Tyr Lys Ser Asp Asp Glu Gly Leu Pro Val Tyr Thr 
E--^> 226 Ala Ser Gly 
E-f> 227 420 425 430 

/ 22 9 Asn Lys Pro Asp He Val Ala Met Asp Thr Lys Ala Gin 
E-T-> 230 Ser Tyr He 
Ef-> 231 435 440 445 

/ 233 Glu Val Ser Leu He Arg Asp Arg Ser Gin Ser Thr Leu 

— > 234 Glu Met He 

— > 235 450 455 460 

237 Pro He Ala Arg His Leu Lys Glu Leu He Lys Asn Ser 
S — > 238 Thr Asp He 
C— > 239 465 470 475 

E — > 240 480 

\ 243 Arg Glu Lys Phe Ser Val Phe Val Ala Pro Asn He His 
E\ — > 244 Asp Asp Ala 
E^-> 245 485 490 495 

\ 247 Lys Glu Tyr Ala Glu Phe Ala Gin Phe Lys Asp Asn He 
E-\> 248 Asn He Cys ' 
E~> 249 500 505 510 - 

^251 Cys Tyr Ala He Asn Asp Phe He Lys Lys -'Val Glu 

E — > 25? Ser He Glu 

520 525 ' 



-> 252 
E-— > 253^515 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/796,669 



DATE: 03/18/2004 
TIME: 15:02:36 



E — > 




Input _Set_j,.^\.ETQ.,JD.aL:t^^ 
^Output Set : N: \CRF4\ 031 7*20 Or4\nJ7-Sj5669 . raw 



Trp Leu Gin lie Asn Asp His Leu Lys Ala 
530 535 





<211> ^^^^^ , 

<212> TYPE: DN7r^_- 

<213> ORGANISM: (PRGANISmT) Helicobacter pylori 



age 



atV 



264 <30Q > PUBLICATION INFORMAXXQ N^ 

265 ^^^V^TABASE AC^IE-S^ON NQ15 dDBJ/EMBL^^ No.:AB118944 
267 p<300> PUgtX€^^ON INFORMATfoN: 4 

268 /atg-'^gg caa gac get gat ttt aaa gcg ctt gaa ^aa ctg 

'aaa gaa tac ttt aat caa get 60 

tta aag eta gaa gaa aat tat ttt age caa cat ttt 

aac aag ttt ttc age tat aaa 120 
gat tgt gtc aaa ate ggt age att aga gag cat ata 

age tta aac tta gat aaa tta 180 
aat aaa gat att tta tta aca age ctg att tat tea 

gat aag ata get aac acg gta 24 0 
ggg cat tat gaa get tat agg aaa aaa gag att ttg caa 

gat aga ttt att ttt gag ctt 300 
att age ect ata aaa eat gat aaa aat ate atg ata gag 
aga aaa aac get aac gaa ttg 360 

286 get aaa acc tta aaa ata gac tta gtc ttt att gat ect 

287 cea tac aat tea agg caa tac 420 

289 age egg ttt tat eat etc tat gaa aac eta gtg cag tgg 

290 aaa aaa ccc aaa etc tat gga 480 

292 aca get tta aag cca tea tgc gag aac atg age gaa tat 

293 tgc cgc tct aat gcc aag aaa 540 

295 gaa ttg age gat tta att gaa aaa eta gat tgt aaa agg 

296 att get tta act tat aat aat 600 

298 acc tat aac tct aag tct age tct teg caa aat aaa ata 

299 ggc ttt aaa gat tta gtg gaa 660 

301 att ttg agt caa aaa gga aaa tta age gtt aaa gaa aag 

302 get cat agt ttt ttt aat tea 720 
304 gga aaa act gat ttt aaa gag cat aaa gaa ttt tt 

v305 ata gtg gaa gtg_aaa ect tga 780 
<210> SEQ ID NC 



/O 



3L8 



ID NO: 5 



?11> 



<2] 



LENGTH:/ 



311 <213> ORGAN I SM : C oRG ANI S^? ) Helicobacter pylori 

313 <30,Q^> PUBLICATION INFORMAT ION 

3 1 4 ^508> DATABASE A C CE 331^15155^ 0 &Ba/£M^^ / Genba n k ; Ac ce s s i on No 
3^^6 < JO 0> yUBLiCATJOir 3lSi5iRS!AT ION : SEQUENCET5'^^ 
317 atg cca caa-'lctc aat aag eta ttc cca aat aac a^'b-vaat 

-^318 caa ttt^tt gag ect ttt gtg 60 

320 ggt ggg ggt age gtg ttt tta aac act aag get aag aga 

321 tac tta get aat gac ata gat 120 

323. act aat att ate aat tta cat aaa act tta age aag ttc 

324 aat gtt tgt gag ctt ttt gat 180 

326 gaa ttg tct aaa att ate att eat tat gge tj;;g'--t?5t ttc 



:AB118944 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/796 , 669 



DATE: 03/18/2004 
TIME: 15:02:36 



Input Se t : I^TVRTO . DA . txt 



Output Set: N:\CRF4\Q3172004\J796669.raw 



32'7 tct ttt aag ggg att atg gcc 240 
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VERIFICATION SUMMARY DATE: 03/18/2004 

PATENT APPLICATION: US/10/796,669 TIME: 15:02:37 

Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\03172004\J796669.raw 



L:5 M:283 W: Missing Blank Line separator, <120> field identifier 
L:6 M:270 C: Current Application Number differs. Replaced Current Application No 
L:6 M:271 C: Current Filing Date differs. Replaced Current Filing J)ete 
L:0 M:201 W: Mandatory field data^jjiis.sing, <130> FILE R^ F^B^NGEl^ 
L:6 M:283 W: Missing~BTank Line separator , <160> field identifier 
L:8 M: 212^1. (34) Invalid or duplicate Sequence ID Number, SEQ ID NO' 
L:8 M:283 W: Missing Blank Line separator, <210> field identifier 
L:t3 M:212 E: ( 3JJ_Lnv.alld_.^Qr_duplica te Se quen ce ID Number, SEQ ID NO 
L:14 M:252 E: No. of_S.ea..^di.££e_rs , <211> LENGT ^HOnput : 0 Found k S SEQ _lQ. 

L : 16^ : 2 12 Ej ( 3 41,,Iiwa44d--e-r-.4i ^ SeauenceTTr iSfui mjer, "SE^ I D NO 

L ; 2 4 M : 2 5 6' W : I nva 1 i d Numeric; H^d£r--Ei4eXd.,„Lden.t-i±d-e-:^-<.3L^ Expecte d^, SEQ : 0 
lV2T^:_212 E: ( 3 4J__Inval^d_ o r__du^^ Sequence ID Number, SEQ ID N£ ' 

L:,25lMj"254 E: No. oF^Ba^s-'ee^^f^^^^^ 

M:254 Repeated in SeqNo=0 ~ ~ ' — - 

L:104 M:252 £.:^No. of Seq. differs, <211> LENGTH : I nputj_0J^ound: 1617 SEQiO 
L:107 M:212 E: (34) Invalid or duplicate Sequenc^e....I.D. N.umberT^rsllQ., T^I^^ 



L:113 M:212'^ 
L:115 M:332 E 



134) Invalld.^.or---d-upi-±effle"^ SEQ ID NO 

(32) Invali(iZMissin.g..-ArainD_AGid_Nu SEQ ID:0 . 

M:332 Repeated in SeqNo=0 ' — 

L:256 Mi.252.„E: No. of Seq. differs, <211> LENGTH : Input : O Found: 538 SEQ:0 
L:267 M<256_W: Invalid Numeric Header Field, Identifier <309> Expected, SEQ: 4* 
L:268 M:254-E: No. of Bases conflict, LENGTH : Input : 0 Counted:39 SEQ:4 
M:254 Repeated in SeqNo=4 

L:308 M:212^ (34) Invalid or duplicate Sequence ID Number, SEQ ID NO ^1--^ 
L:316 M:256 W: Invalid Numeric Header Field, Identifier <309> Expected, SEQiO 
L:316 M:212 £^(34) Invalid or'duplicate Sequence ID Number, SEQUENCE ID NOS:0 differs:- 
L:317 M:254 E:. No. of Bases conflict, LENGTH: Input : 0 Counted:39 SEQ:0 
M:254 Repeated in SeqNo=0 

L:362 M:252 E: No. of Seq. differs, <211> LENGTH : Input : 0 Found: 846 SEQ:(>^ 
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PCI PI/EX EC/22/6 
Annex Vn, page 29 




/o/7fG/((>0>'r (F) 





<110> 

<120> 

<130> 

<140> 
<141> 



<150> 
<151> 



<160> 

<170> 

<210> 
<211> 
<212> 
<213> 



<220> 
<221> 
<222> 

<300> 
<301> 

<302> 

<303> 
<304> 
<305> 
<306> 
<307> 
<308> 
<309> 



Smith, John; Smithgene Inc. 
Example of a Sequence Listing 
01-00001 



PCT/EP98/00001 
1998-12-31 



US 08/999,999 
1997-10-15 
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<400> 1 

agctgtagtc attcctgtgt cctcttctct ctgggcttct caccctgcta atcagatctc 

agggagagtg tcttgaccct cctccgcctt tgcagctcca caggcaggcci ggcaggcagc 

tgatgtggca attgctggca gcgccacagg ctcttcagcc aggcttaggg tgggttccgc 



60 
120 
180 



cgcggcgcgg cggcccctct 



cgcgctcctc tcgcgcctct ctctcgctct cctctcgctc 



240 



PCIPmEXEC/22/6 
Annex VII, page 30 



ggacctgatt aggtgagcag gaggaggggg cagttagc atg gtt tea atg ttc age 296 

Met Val Ser Met Phe Ser 

1 5 

ttg tet tte aaa tgg cct gga ttt tgt ttg ttt gtt tgt ttg ttc caa 344 

Leu Ser Phe Lys Trp Pro Gly Phe Cys Leu Phe Val Cys Leu Phe Gin 

10 15 20 

tgt ccc aaa gtc etc ccc tgt cae tea tea ctg cag ceg aat ctt ^ 389 

Cys Pro Lys Val Leu Pro Cys His Ser Ser Leu Gin Pro Asn Leu 

25 30 35 



<210> 2 

<211> 37 

<212> PRT 

<213> Paramecium sp, 

<400> 2 

Met Val Ser Met Phe Ser Leu Ser Phe Lys Trp Pro Gly Phe Cys Leu 
15 10 15 

Phe Val Cys Leu Phe Gin Cys Pro Lys Val Lou Pro Cys His Ser Sor 
20 25 - 30 

Leu Gin Pro Asn Lou 

35 . 

<210> 3 

<211> 11 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Designed peptide based on size and polarity to act as a 
linker between the alpha and beta chains of Protein XYZ . 

<400> 3 • 

Met Val Asn Leu Glu Pro Met His Thr Glu lie 
1 5 10 

<210> 4 
<400> 4 
000 



[Annex VIII follows] 



